Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry

نویسندگان

  • Sunil Gupta
  • Truyen Tran
  • Wei Luo
  • Dinh Phung
  • Richard Lee Kennedy
  • Adam Broad
  • David Campbell
  • David Kipp
  • Madhu Singh
  • Mustafa Khasraw
  • Leigh Matheson
  • David M Ashley
  • Svetha Venkatesh
چکیده

OBJECTIVES Using the prediction of cancer outcome as a model, we have tested the hypothesis that through analysing routinely collected digital data contained in an electronic administrative record (EAR), using machine-learning techniques, we could enhance conventional methods in predicting clinical outcomes. SETTING A regional cancer centre in Australia. PARTICIPANTS Disease-specific data from a purpose-built cancer registry (Evaluation of Cancer Outcomes (ECO)) from 869 patients were used to predict survival at 6, 12 and 24 months. The model was validated with data from a further 94 patients, and results compared to the assessment of five specialist oncologists. Machine-learning prediction using ECO data was compared with that using EAR and a model combining ECO and EAR data. PRIMARY AND SECONDARY OUTCOME MEASURES Survival prediction accuracy in terms of the area under the receiver operating characteristic curve (AUC). RESULTS The ECO model yielded AUCs of 0.87 (95% CI 0.848 to 0.890) at 6 months, 0.796 (95% CI 0.774 to 0.823) at 12 months and 0.764 (95% CI 0.737 to 0.789) at 24 months. Each was slightly better than the performance of the clinician panel. The model performed consistently across a range of cancers, including rare cancers. Combining ECO and EAR data yielded better prediction than the ECO-based model (AUCs ranging from 0.757 to 0.997 for 6 months, AUCs from 0.689 to 0.988 for 12 months and AUCs from 0.713 to 0.973 for 24 months). The best prediction was for genitourinary, head and neck, lung, skin, and upper gastrointestinal tumours. CONCLUSIONS Machine learning applied to information from a disease-specific (cancer) database and the EAR can be used to predict clinical outcomes. Importantly, the approach described made use of digital data that is already routinely collected but underexploited by clinical health systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Breast Tumor Malignancy Using Neural Network and Whale Optimization Algorithms (WOA)

Introduction: Breast cancer is the most prevalent cause of cancer mortality among women. Early diagnosis of breast cancer gives patients greater survival time. The present study aims to provide an algorithm for more accurate prediction and more effective decision-making in the treatment of patients with breast cancer. Methods: The present study was applied, descriptive-analytical, based on the ...

متن کامل

مقایسه مدل شبکه عصبی مصنوعی و رگرسیون پارامتری در پیش‌بینی بقای بیماران مبتلا به سرطان معده

Background & Objective: Using parametric models is common approach in survival analysis. In the recent years, artificial neural network (ANN) models have increasingly used in survival prediction. The aim of this study was to predict of survival rate of patients with gastric cancer by using a parametric regression and ANN models and compare these methods. Methods: We used the data of 436 gast...

متن کامل

Survival from skin cancer and its associated factors in Kurdistan province of Iran

Background: We explored survival of skin cancer and its determinants in Kurdistan province of Iran. Methods: In a retrospective cohort design, we identified all registered skin cancer patients in Kurdistan Cancer Registry from year 2000 to 2009. Information on time and cause of death were obtained from Registrar’s office and information on type, stage and anatomic locations were extracted fr...

متن کامل

Development of an Ensemble Multi-stage Machine for Prediction of Breast Cancer Survivability

Prediction of cancer survivability using machine learning techniques has become a popular approach in recent years. ‎In this regard, an important issue is that preparation of some features may need conducting difficult and costly experiments while these features have less significant impacts on the final decision and can be ignored from the feature set‎. ‎Therefore‎, ‎developing a machine for p...

متن کامل

Does ethnicity affect survival following colorectal cancer? A prospective, cohort study using Iranian cancer registry

  Background:The present study compared the differences between survivals of patients with colorectal cancer according to their ethnicity adjusted for other predictors of survival.   Methods: In this prospective cohort study patients were followed up from definite diagnosis of colorectal cancer to death. Totally, 2431 person-year follow-ups were undertaken for 1127 colorectal cancer patients on...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2014